Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity
نویسندگان
چکیده
A new type of video retrieval system is proposed that identifies a target video section by searching for a word passage submitted as a quoted speech or text query. The proposed system has two unique characteristics. The first characteristic is that it is based on subword models such as phonemes, syllables, and morphemes so the system is able to deal with any type of query, including new words and personal names. The second characteristic is that the system relies on acoustic similarity between subword models. Furthermore, new subword models were constructed for the retrieval system to improve performance. The new models were based on two concepts: context-dependent models and more sophisticated in the time axis than phone models. Through experimentation, the effectiveness and scope of the proposed spoken document retrieval system were confirmed, and suitable subword models for the proposed method discussed.
منابع مشابه
An Investigation of Subword Unit Representations for Spoken Document Retrieval
This study investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recogn...
متن کاملMultilayer subword units for open-vocabulary spoken document retrieval
This paper describes the application of subword units in an effort of improving open-vocabulary spoken document retrieval performance in the case of highly corrupted recognition output. This paper presents the developed open-vocabulary spoken document retrieval system including the newly proposed subphonetic segment unit and combining multilayer subword units. Our experiments on Japanese spoken...
متن کاملAn integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval
Spoken document retrieval (SDR) systems must be vocabulary-free in order to deal with arbitrary query words because a user often searches the section where a query word is spoken, and query words are liable to be special terms that are not included in a speech recognizer’s dictionary. We have previously proposed new subword models, such as the 1/2 phone model, the 1/3 phone model, and the sub-p...
متن کاملSubword unit representations for spoken document retrieval
This paper investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recogn...
متن کاملAn STD system for OOV query terms using various subword units
We have been proposing a Spoken Term Detection (STD) method for Out-Of-Vocabulary (OOV) query terms using various subword units, such as monophone, triphone, demiphone, one third phone, and Sub-phonetic segment (SPS) models. In the proposed method, subword-based ASR is performed for all spoken documents and subword recognition results are generated using subword acoustic models and subword lang...
متن کامل